Picture for Freda Shi

Freda Shi

Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

Add code
May 26, 2026
Viaarxiv icon

Real Images, Worse Judgments: Evaluating Vision-Language Models on Concreteness and Imagery

Add code
May 26, 2026
Viaarxiv icon

From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models

Add code
May 19, 2026
Viaarxiv icon

Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages

Add code
Mar 26, 2026
Viaarxiv icon

A Very Big Video Reasoning Suite

Add code
Feb 24, 2026
Viaarxiv icon

From Tokens to Numbers: Continuous Number Modeling for SVG Generation

Add code
Feb 02, 2026
Viaarxiv icon

Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents

Add code
Oct 06, 2025
Figure 1 for Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents
Figure 2 for Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents
Figure 3 for Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents
Figure 4 for Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents
Viaarxiv icon

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Add code
Oct 02, 2025
Viaarxiv icon

Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce

Add code
May 18, 2025
Viaarxiv icon

DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models

Add code
May 11, 2025
Viaarxiv icon